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Abstract 

Goal and Background. Uncertainty is commonly not taken into 
account in LCA studies, which downgrades their usability for 
decision support. One often stated reason is a lack of method. 
The aim of this paper is to develop a method for calculating the 
uncertainty propagation in LCAs in a fast and reliable manner. 
Approach. The method is developed in a model that reflects the 
calculation of an LCA. For calculating the uncertainty, the model 
combines approximation formulas and Monte Carlo Simula¬ 
tion. It is based on virtual data that distinguishes true values 
and random errors or uncertainty, and that hence allows one to 
compare the performance of error propagation formulas and 
simulation results. The model is developed for a linear chain of 
processes, but extensions for covering also branched and looped 
product systems are made and described. 

Results. The paper proposes a combined use of approximation 
formulas and Monte Carlo simulation for calculating uncertainty 
in LCAs, developed primarily for the sequential approach. Dur¬ 
ing the calculation, a parameter observation controls the per¬ 
formance of the approximation formulas. Quantitative thresh¬ 
old values are given in the paper. The combination thus 
transcends drawbacks of simulation and approximation. 
Conclusions and Outlook. The uncertainty question is a true jig¬ 
saw puzzle for LCAs and the method presented in this paper may 
serve as one piece in solving it. It may thus foster a sound use of 
uncertainty assessment in LCAs. Analysing a proper management 
of the input uncertainty, taking into account suitable sampling 
and estimation techniques; using the approach for real case stud¬ 
ies, implementing it in LCA software for automatically applying 
the proposed combined uncertainty model and, on the other hand, 
investigating about how people do decide, and should decide, 
when their decision relies on explicidy uncertain LCA outcomes 
- these all are neighbouring puzzle pieces inviting to further work. 


Keywords: Approximation formula; error propagation; life cycle 
inventory (LCI); life cycle impact assessment (LCLA); models; 
Monte Carlo Simulation; quantitative thresholds; uncertainties 


1 Uncertainties in LCAs 

The term 'uncertainties' describes the fact that measured 
values frequently do not match the true values, but differ 
from them in a probabilistic manner. In a more narrow sense, 
uncertainties are probabilistic errors of quantitative values. 



An error is defined as the difference between the measured 
value and the true value (Fig. 1). 

This definition may also be expressed in a simple formula 
(Eq. 1): 

Ax = x - x t , or x = x t + Ax (1) 

with 

Ax: Error in x 

x: Observed / measured value for variable x 
x t : True value for x 

Given that the true value is unknown for any measurement, 
it is not possible to exclude uncertainty in measured values 
completely or, in other words, to obtain error-free, certain 
values. This holds true also for quantitative data used in 
Life Cycle Assessments (LCAs). Therefore, quantitative data 
used in LCAs (material flows, etc.) is also uncertain. 

If a calculation is performed based on several data points 
which may be uncertain, the uncertainties propagate through 
the system, show in the calculated result and, provided there 
are any, also show in intermediate results, for the LCA e.g. 
in the inventory. This propagation is not easy to foresee, 
large uncertainties may result from small input uncertain¬ 
ties, while on the other hand, uncertainties from different 
input values may diminish each other. 

For identifying uncertainties, the single random error is not 
of much interest, since it will differ in every calculation due 
to its random nature. Interesting is the 'average' or 'mean' 
error that is likely to appear. For a random error of a vari¬ 
able x, this 'standard mean error' is calculated as the stand- 
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ard deviation s of the random errors Ax,, obtained from a 
series of calculations [19, p. 2], [20, p. 45] (Eq. 2): 



with 

x;. Observed value for variable x 
x t : True value for x 

m: Number of calculations performed 

For practical calculations, it is common to use the mean x 
as an estimate for the (unknown) true value [20, p. 45]: 


If a calculation is performed with uncertain data, and the 
uncertainty is disregarded, then a decision based on the cal¬ 
culation result may be made completely in the wrong direc¬ 
tion [32, p. 48]. 

Considerable work has been done in the field of uncertainties 
and LCAs. Uncertainty is addressed in LCA data formats [3- 
4,14]; uncertainty is commonly seen as an indicator for data 
quality [5-6]. Several approaches try to reduce the uncertainty 
in LCA results by identifying hot spots for uncertainty propa¬ 
gation, both in the calculation and in data used [7-9]. 

In an abstract way, the 'uncertainty problem' in LCAs can 
be seen as a simple input/output box (Fig. 2), with three 
sub-problems: 

- Problem 1: Assessing errors in input data; 

- problem 2: Assessing the propagation of errors in the calcuiation; 

- problem 3: Assessing errors in the calculation's outcome, inter¬ 
preting outcomes with inherent errors and uncertainties 



Fig. 2: The calculation of an LCA, seen as an Input/Output box, and three 
sub-problems related to the assessment of quantitative errors 


2 Approaches of Uncertainty Estimation in LCAs: 
Strengths and Weaknesses 

There are a number of proposals on how to describe the error 
propagation in a calculation. Proposals include interval calcula¬ 
tion [10], fuzzy logic approaches (e.g. [11]), Gaussian error propa¬ 
gation formulas [12], and Monte Carlo Simulation [7,9,15]. 

A Monte Carlo Simulation uses a simple yet ingenious proce¬ 
dure (e.g. [21]): It basically varies input data of the calculation 
according to a given probability distribution, runs the calcula¬ 
tion, and stores the outcome / output data of the calculation. 
This procedure must be repeated often enough (typically 


10,000 times) in order to be sure to obtain input values that 
adequately represent the selected probability distribution. 

For a function f(x) = y, one run of the simulation may be 
expressed as in Eq. 3: 

f(x) = f(x t + Ax) = y,+ Ay = y (3) 

with (for other symbols, see Eq. 1): 

Ay: Error in y 

y: Observed / calculated value for variable y 
y t : True value for y 

Performing the calculation without introducing errors gives the 
undisturbed, 'true values' for the model, while performing the 
calculation with errors in input data yields output data with 
inherent errors. Subtracting both results gives the true error in 
the outcome (see Eq. 1). By calculating the standard deviation 
from the outcome y (see Eq. 2), a reasonable estimate for the 
'average' true error, the mean standard error, may be obtained. 

Thus, the Monte Carlo Simulation is able to reveal true un¬ 
certainty in the calculation's result. In other words, the Monte 
Carlo Simulation fits well for part 2 of the uncertainty prob¬ 
lem (see Fig. 2). However, the simulation cannot correct ill- 
specified input uncertainties (part 1 of the uncertainty prob¬ 
lem), and it does not tell what to do with the uncertainty it 
calculates (part 3 of the uncertainty problem). 

As a drawback, simulating all input parameters can easily ex¬ 
ceed available computer resources and time, for a complete and 
extensive LCA. For this reason, a common approach is to select 
parameters from all possible parameters, and to simulate un¬ 
certainties only for those parameters that have, by expert judge¬ 
ment, an assumed relevance for the uncertainty in the outcome. 
This approach is implemented in LCA software [34, pp 96], 

On the other side, all other methods mentioned above have 
the problem that, especially under unfavourable conditions 
(non-linearity in the calculation, relatively high random er¬ 
rors), the uncertainty estimate given by approximation for¬ 
mulas can deviate largely from the 'true uncertainty'. In ad¬ 
dition, a control of their estimate is difficult, since 'true errors' 
and 'true values', and thus also the true value for the uncer¬ 
tainty in the outcome, are not known. 

In summary, both simulation and approximation suffer from 
a problem in uncertainty calculation: The simulation is able 
to give a reasonable estimate for the uncertainty, but in the 
common application only for previously selected parameters 
that are assumed to be most relevant; the approximation 
can only approximate the uncertainty, and it frequently can¬ 
not give a reasonable estimate. 

3 An Uncertainty Estimation Model for LCAs 

3.1 Aim of the model, and principal approach for building it 

Aim of the model. The Model has the aim (1) to investigate 
error propagation / uncertainties in LCAs, and (2) to find an 
approach for calculating the uncertainties in an LCA. In addi¬ 
tion, this approach should be efficient - as short in computing 
time as possible, as moderate in computing resource demands 
as possible - it should provide estimates for the uncertainty as 
correct as possible, and it should be applicable in practice. 
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The model has to recognise the specific drawbacks of approxi¬ 
mation and simulation (see section 2), that apply also for LCAs. 
Specifically, the approach should be able to deal with uncer¬ 
tainty in all parameters of a study (not only in selected param¬ 
eters), and it should provide 'parameters' that indicate how 
well different formulas perform in calculating the uncertainty. 
These parameters should be measurable in a real case study. 
Ciroth [16] developed the basic model in a doctoral thesis. 

Principle approach: A virtual LCA case study. A straightfor¬ 
ward approach would be to analyse an existing case study 
for uncertainties and error propagation, to implement dif¬ 
ferent algorithms for uncertainty calculation for this case 
study, and to compare and analyse the results. However, in 
an LCA case study, as in any quantitative data, the true value 
for random errors, and thus the uncertainty, is an inherent 
property of data. It is not directly 'visible' or measurable 
(see also section 1). For this reason, true values for uncer¬ 
tainties cannot be measured from a case study, they can only 
be estimated. However, these true values are essential to 
decide how well different algorithms are able to calculate 
the uncertainty (aim 2, above), and they are important to 
investigate the uncertainty propagation in LCAs (aim 1). In 
addition, one single case study does not allow a variation of 
all different parameters of an LCA (some parameters are 
fixed for the study) - which is important for the analysis of 
uncertainty propagation, and for finding suitable parameters. 

For both of these reasons, a different concept has been used 
instead: 

- We implemented a model that calculates an LCA, including ap¬ 
proximation formulas and simulation. 

- Within the model, a virtual case study was created with the aim of 
coming as close to a realistic case study as possible. Data in the 
case study was generated according to probability distributions 
for true values and random errors. 

- This case study was analysed by means of explorative data analy¬ 
sis in order to find relevant parameters. 

- A subsequent step investigated relevant parameters in order to 
condense the number of parameters further, and to identify thresh¬ 
old values for the remaining parameters. In this step, parameters 
for single calculation steps of an LCA were varied systematically, 
overwriting case study data. 

Fig. 3 shows the sequence of interlinked calculations imple¬ 
mented in the model. The model covers inventory calcula¬ 
tion, impact assessment calculation, and some possible ele¬ 
ments of valuation (normalisation, and a calculation of the 
difference between two scenarios). 



Fig- 3: The calculation of an LCA as a sequence of calculation steps, as 
implemented in the model, with qualitative and quantitative data input. 
Capital / number pairs (A1.1) refer to elements of the model 


uncertainties are squared, and a square root is extracted 
from the whole sum (Eq. 4): 


s. 


l£- kf 




(4) 


with 

S y : Random error estimate in y (calculation result); 

S x : Random error estimate in x k (input value); 

k 

: Partial derivative of the calculation function f. 
ox k 


Uncertainty calculation methods in the model. For each of 
the calculation steps, the model contains calculation and 
approximation formulas derived in Ciroth's thesis [16]. From 
several approximation formulas investigated, the following 
two were most promising: 

• The 'error propagation formula', originally by Gauss, in 
which the random error (or uncertainty) in a calculated 
figure y, y = f(x v x 2 , ..., x M ) is given by the sum of the 
partial 1 st order derivatives of this function, multiplied 
by their uncertainty estimates. Both derivative and input 


• A higher order extension of the Gaussian error propaga¬ 
tion formula, given by Bader and Baccini [18, pp 108] 
(Eq- 5): 

s y =yf{s m yf +(s <2 >v) 2 (5) 

with 

^f_' L y —> the Gaussian formula for error 
v 3jq J x ‘ propagation, see Eq. 4 
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—» the extension of the Gaussian formula. 
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Note that the Bader / Baccini extension spans matrices of 
the input values' derivatives and uncertainty estimates, with 
indices k and 1 for columns and lines, respectively, and M as 
the number of input values. 

These results from approximation formulas are compared 
to results obtained from a Monte Carlo Simulation, which 
is able to provide estimates for the true uncertainties (see 
also section 2), also within the LCA model, and may hence 
serve as a reference. 

Algorithm used in the model for calculating the inventory, 
impact assessment, and valuation. Algorithms for the calcu¬ 
lation of LCAs are not generally accepted, and frequently 
overlooked in LCA papers [27, p. 3]. There are at least two 
different ways for calculating an LCA: A 'network approach', 
also called 'sequential method' [27, p. 100], which can be 
extended to a Petri net calculation [23-26], on the one hand, 
and a matrix approach [12,27], on the other. 

The matrix approach solves a linear equation system simul¬ 
taneously, and, according to Heijungs and Suh, requires the 
performance of a matrix inversion for solving the linear equa¬ 
tion system As = f [27, pp 20], 

A number of different algorithms exist for inverting a matrix. 
For practical applications, Heijungs and Suh recommend an 
implementation of different inversion algorithms for LCAs to 
be able to choose from and compare to each other [27, p. 198], 

However, performing a matrix division is sufficient for solv¬ 
ing the linear equation system, and numerically preferable 1 
[13,29-30]. For performing a matrix division, generally the 
Gauss elimination algorithm, either in its pure form or in 
optimised modifications like the LU decomposition, is pre¬ 
ferred [30]. Both matrix division and matrix inversion re¬ 
quire a square matrix; in cases where it is not possible to 
have a square matrix A, a solution may be obtained by mini¬ 
mising the norm of the difference vector ||As-/%, following 
the method of Gaussian squares [40]. 

The network approach, on the other hand, typically starts 
at one 'central' process; from this process, flows exchanged 
with other processes are calculated (amount of flows for 


1 A citation from the Matlab Function reference: "In practice, it is seldom 
necessary to form the explicit inverse of a matrix [by inv]. A frequent 
misuse of inv arises when solving the system of linear equations. One 
way to solve this is with x = inv(A)*b. A better way, from both an execu¬ 
tion time and numerical accuracy standpoint, is to use the matrix division 
operator x = A\b. This produces the solution using Gaussian elimination, 
without forming the inverse" [29]. 


products and pre-products); the connected processes need then 
be scaled with a linear factor, the scaling factor, according to 
their contribution to the product system; process balances (typi¬ 
cally taken from a database) are multiplied with the scaling 
factor for each process; the aggregation of all process balances 
in the product system is the inventory. Flows that are exchanged 
between processes vanish in the inventory, which ideally con¬ 
tains only elementary flows. These elementary flows are clas¬ 
sified into impact categories, multiplied by characterisation 
factors specific for each category, and the products are finally 
aggregated within each impact category. The result is an indi¬ 
cator value per impact category [22, pp 593], [16, pp 17]. 

The approximation formulas used in this paper refer to the 
sequential approach. Formulas used in the model are given 
in the Appendix. 

3.2 Virtual case study, EDA, and systematic analysis 

The virtual case study was built in the following manner: Its 
product system consisted of two linear chains of 52 proc¬ 
esses each (Fig. 4, n = 52), one chain for upstream and one 
for downstream processes. Some processes appear multiple 
times in the product system, which corresponds to processes 
like transport via truck, in LCA studies. In summary, there 
are 60 unique processes in the system. Processes in the proc¬ 
ess chain were grouped in these unique processes by assign¬ 
ing uniformly distributed numbers between 1 and 60 to them. 
As a result, every process appears between one and five times 
in the process chain. Every process has a true value for one 
input flow in its balance, linking to one other process, and 
for one output flow, linking to one other process. These val¬ 
ues were obtained following a normal distribution. With 
the exception of the utmost processes in the chain, each proc¬ 
ess is linked to two other processes via product flows, and 
each of the process balances contains between 3 and 20 el¬ 
ementary flows. The number of elementary flows per unique 
process follows a uniform distribution. Ten different impact 
categories were calculated. Via random number generation, 
each category was assigned between 3 and 30 elementary flows 
of the overall 495 elementary flows, following a uniform dis¬ 
tribution. The resulting structure of the model (processes with 
flows and types of elementary flows, elementary flows assigned 
to impact categories) was not changed in the simulations. 

In addition, each product flow - flow between processes - 
and elementary flow was assigned a normally distributed 
true value, and a normally distributed error. True values for 
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characterisation factors and for reference values used in the 
normalisation (step Cl) were generated following a normal 
distribution. In the simulation, we varied the error to get 
variations in the true value or, in terms of Eq. 3, we varied 
Ax to obtain variations in y, and thus, via the standard de¬ 
viation in y, the Monte Carlo Simulation's estimate for the 
uncertainty in y. In every case, the probability distribution 
of the error in the simulation remains the same as in the 
initial generation of the error. Each Monte Carlo Simulation 
took 10,000 runs. For example, impact category '7' was as¬ 
signed elementary flows 49, 77, 123, and 147, and elemen¬ 
tary flow 147 has a true value of 3.76 and an error of about 
0.45. The latter, the error, was varied in the simulations ac¬ 
cording to the probability distribution described above. For 
further details of the virtual case study, please refer to [16]. 

As a result, the case study model consists of a linear chain of 
processes; it calculates an LCA from inventory to impact 
assessment to normalisation and scenario comparison. Er¬ 
rors in quantitative data which it uses are kept apart from 
true values, and both errors and true values are observable 
in every calculation step. 

Explorative Data Analysis EDA. Aim of the analysis is to 
explore the virtual case study for relevant patterns and pa¬ 
rameters, which indicate whether uncertainty estimates given 
by approximation formulas are close to those of a Monte 
Carlo Simulation. The analysis thus follows the idea of ex¬ 
ploratory data analysis (EDA) 2 . We plotted, sorted and clus¬ 
tered the data calculated for each step separately, and for 
the whole LCA calculation procedure as well. It helped that 
the virtual case study provided a broad range of parameter 
constellations, including unplanned ones, due to the random 
generation of input data. Investigated parameters comprise 
the amount of flows exchanged between processes (flows in 
technosphere), the number and amount of all elementary flows 
and of all characterisation factors, the amount of the refer¬ 
ence values used in normalisation; in fact, the number of times 
each process appears in the process system, for both true val¬ 
ues and errors, and the position of each process in the chain, 
i.e. if the calculated uncertainty is based on input data alone 
(process 1 in Fig. 4) or comes with a high degree of estimated 
uncertainty based on previous processes. 

Main findings of the EDA were: 

- The absolute amount of the true value, and of the true error, has a 
high influence on the performance of the approximation formula. 

- The Gauss formula is always more vulnerable to high uncertain¬ 
ties than the Bader Baccini formula. 

- A problem in the application of approximation formulas is that they 
need to build upon the uncertainty they calculate in preceding cal¬ 
culation steps. Since they do not manage to provide a correct es¬ 
timate, the following estimate gets worse. Thus, there is a 'meta 
error propagation' (error propagation in calculating the random 
error) for approximation formulas, and the error in the error calcu¬ 
lation is an important parameter for the performance of the for¬ 
mula in a next calculation step. 


2 Explained by Tukey as "exploratory data analysis is detective work. [...] 
restricting one's self to the planned analysis - failing to accompany it 
with exploration - loses sight of the most interesting results too frequently 
to be comfortable" [36, p. 3], 


- Some of the multitude of parameters appeared to have more rel¬ 
evance for uncertainty propagation and calculation. For step A1.1, 
for instance, the input uncertainty, the absolute amount of flows, 
the relative uncertainty in these flows, and the length of the proc¬ 
ess chain were more relevant by far than other parameters. 

Systematic analysis of selected parameters. In a second step, 
we tested each of those parameters more thoroughly that 
appeared to be relevant in the previous analysis. To find 
thresholds for each relevant parameter where the approxi¬ 
mated uncertainty starts to deviate from the Monte Carlo 
uncertainty, the value of each parameter was varied con¬ 
tinuously within intervals. As in the virtual case study, val¬ 
ues from the approximation formulas were compared to 
simulation results. The analysis was no longer based on each 
single LCA calculation step, but on types of calculation steps, 
grouping those with identical formulas and similar param¬ 
eters, such as step A2.1 and B2.1 for example. 

The analysis served to condense the number of relevant pa¬ 
rameters. Some of the parameters investigated proved not 
to be relevant themselves, but mere indicators of other pa¬ 
rameters or effects. For example, the number of processes 
preceding a process seemed, in the virtual case study, a highly 
important parameter for step Al.l. A result of the system¬ 
atic analysis is, in fact, that the error in the calculated input 
uncertainty is the only relevant parameter, see the following 
section for a specific example. 

3.3 Results of the model analysis 

The two-step procedure provided results for every calcula¬ 
tion step shown in Fig. 3. As an example, Fig. 5 shows re¬ 
sults for Al.l, the calculation of scaling factors, from the 
systematic analysis, with input uncertainty and the length 
of the process chain as varied parameters. For small input 
uncertainties (relative uncertainty 0.01, Fig. 5a), nearly no 
difference between the compared methods is visible. Rais¬ 
ing the input uncertainty to 0.1 changes the true uncertainty 
from a digressive shape to a linear function (see also the 
fitted functions in the boxes in Fig. 5). The Bader / Baccini 
formula now also produces a linear function, but with a 
lower slope, while Gauss still has a digressive function (see 
Fig. 5b). Consequently, calculated uncertainty estimates dif¬ 
fer considerably with a longer process chain, and the differ¬ 
ence for the Gauss formula is higher. With a relative input 
uncertainty of 0.2, the picture changes dramatically; the 
Monte Carlo Simulation uncertainty now has a shape that 
is best met by a third degree polynomial, with extremely 
high values for relative uncertainty of up to 70, while both 
approximation formulas do not change the shape of their 
plots (see Fig. 5c). 

For interpreting these figures, the following thoughts are 
useful: For each process k, the input uncertainty can be sepa¬ 
rated (i) into the uncertainty in parameters of process k, 
namely the uncertainty in the flows to other processes 
mOutput k and mlnput k , and (ii) into the uncertainty in the 
scaling factor p of the preceding process k-1. Within the 
parameter values analysed for the results, the propagating 
uncertainty of the scaling factors is far more important for 
the uncertainty of the scaling factor of process k than the 
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Fig. 5: Results for the calculation of the process scaling factors, with relative uncertainty in input parameters of 0.01 (Fig. 5a), 0.1 (Fig. 5b), and 0.2 (Fig. 5c) 
with: n: number of each process in the process chain; s p (n)/p(n): relative uncertainty in scaling factor for process n 


parameter uncertainty. For process n = 50 in Fig. 5b, for 
example, the propagating relative uncertainty (uncertainty 
from process 49) is 2.27, and the parameter uncertainty (from 
mOutput 50 ) is 0.1. In Fig. 5a, the propagating uncertainty 
for process 50 is 0.10, the input uncertainty is 0.01, and the 
Gauss formula gives a reasonable uncertainty estimate. Ob¬ 
viously, the number of processes in the chain is not directly 
important for the performance of the approximation for¬ 
mula, since it is identical for both cases in process 50. Im¬ 
portant is the relative input uncertainty, expressed as the 
relative uncertainty of the scaling factor of the preceding 
process. If the relative input uncertainty here is below 0.2, 
the Gaussian approximation formula may be used. If it is 
below 0.8, the Bader / Baccini formula gives a reasonable 
estimate for the uncertainty, and above, the Monte Carlo 
simulation should be used. The thresholds can be read from 
the plots of the results. In addition, we consulted the calcu¬ 
lated data, and additional plots of the relative difference of 
approximation formula and determined the relative uncer¬ 
tainty where the approximation formula result equals the 
simulation result, with a tolerance of 10%, which we 
rounded down to be on the safe side. Fig. 6a and 6b corre¬ 
spond to the example shown in'Fig. 5b. It plots the relative 


difference for Gauss (Fig. 6a) and for Bader Baccini (Fig. 6b) 
over the relative uncertainty, with a relative input uncer¬ 
tainty of 0.1 (the same as in Fig. 5b). It also shows the 10% 
interval and, in the thesis, it motivated the 20% threshold 
for Gauss (the interval is exceeded at a relative uncertainty 
if 0.26, which is rounded down to 0.2), and the 80% thresh¬ 
old for Bader (here the interval is left at 0.8 which is then 
also the threshold). Note that the simulation does not have 
a function as a result. The regression curves are only esti¬ 
mates with no more information than plots; in Fig. 5b, for 
example, regression curves for Gauss and Monte Carlo simu¬ 
lation do not meet at all (this holds also if the real Gauss 
approximation formula is taken, which is not shown in the 
plot). Hence, the way to interpret plots instead of analysing 
functions is the most direct one; it allows one to deal with 
the raw, non-modified data. 

For every calculation step in the model, the relative uncer¬ 
tainty (in statistical expression: the variation coefficient) turns 
out to be the determining parameter for the goodness of fit of 
the uncertainty as calculated by the approximation formulas. 

A second major result is that both approximation formulas 
manage, within the model, to calculate the uncertainty cor- 
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Fig. 6: Relative difference for Gauss (6a) and for Bader Baccini (6b) over relative uncertainty, for A1.1, with a relative input uncertainty of 0.1 
(the same as in Fig. 5b). s p (n)/p(n): relative uncertainty in scaling factor for process n; the relative difference is calculated; for Gauss as 
(s p(n)Monte Carlo - sp(n)Gauss) / sp(n)Monte Carlo *100%, and for Bader as (s p(n)Monte Carlo - sp(n)Bader) / sp(n)Monte Carlo *100% 


rectly for every calculation step, unless the relative error that 
they calculate does not exceed certain thresholds. The Bader 
/ Baccini formula generally has higher thresholds. Those 
calculation steps that use only addition and substraction 
(A1.2; A2.2; B2.2; C2), always give a reasonable estimate for 
uncertainty, hence there are no thresholds. For the other cal¬ 
culation steps, a threshold was determined similar to the ex¬ 
ample given above. Table 1 summarises the thresholds for each 
calculation step. Note that thresholds need to be applied on 
the calculation result, see Fig. 7, which generalises the thoughts 
above on the calculation of process scaling factors. 

More sensitive calculation steps generally have lower thresh¬ 
olds. The results summarised in Table 1 allow recommend¬ 
ing an approach for estimating uncertainties in LCA results; 
Start the calculation with approximation formulas, Gaussian 
or, preferable if implemented, the Bader / Baccini formula, 


propagated 

uncertainty 


input 

uncertainty 


calculation 
x -1 


calculation x 


thresholds apply 
here 


output 

uncertainty 


Fig. 7: Input uncertainty, propagated uncertainty, output uncertainty, and 
the application of the thresholds, per calculation. 'Calculation' means: 
Calculation step (see Fig. 3), or also sub-steps within a, sequentially cal¬ 
culated, calculation step, like the process scaling factors in step A1.1 


and check during the calculation for thresholds according 
to Table 1. If these thresholds are exceeded, either change 
from Gauss to Bader / Baccini or from Bader / Baccini to a 
Monte Carlo Simulation, which may also be used 'piecewise', 
for single calculation steps, or even for sections within one 
calculation step, e.g. for a part of the process chain. 


3.4 Consequences of a branched and looped product system 

The model as described in the previous chapters is based on a 
linear chain of processes as the simplest possible form of a 
product system. Practical examples for product systems are 
usually more complex, they include branches of process chains 
and loops of processes. How does a more complicated struc¬ 
ture affect the interpretations gained from the model? 

Fig. 8 shows a branched chain of processes (left) and a loop 
within a process chain (right). The boxes represent proc¬ 
esses; the sequential calculation starts from process 1 and 
continues to higher numbers. 

As indicated in the figure, a branched chain can be split into 
two separate linear chains, which are identical before the 
branching process (in the picture process number 3). Calcu¬ 
lation and results for each of the branches that are now di¬ 
rectly connected to the 'stem' are similar to the linear chain 
of processes represented in the model. The stem (processes 
1 to 3 in Fig. 8, on the left) needs to be calculated only once 
since it is identical in both cases. As a conclusion, it is not 


Table 1: Thresholds for the use of approximation formulas for estimating the uncertainty in the calculation steps of an LCA 


Calculation step 

{ Uncertainty calculation method: recommended use until 


Gauss 

Bader / Baccini 

Monte Carlo 

A 1.1 

Variation coefficient in input scaling factor < 20% 

Variation coefficient in input scaling factor < 80% 

No threshold 

A1.2 

No threshold 

No threshold 


A2.1 

Variation coefficient in scaling factors and in 
elementary flows < 40% 

Variation coefficient in scaling factors and in 
elementary flows < 100% 

No threshold 

A2.2 

No threshold 

No threshold 

No threshold 

B2.1 

Variation coefficient in characterisation factors and 
aggregated elementary flows in inventory < 40% 

Variation coefficient in characterisation factors and 
aggregated elementary flows in inventory < 100% 

No threshold 

B2.2 

No threshold 

No threshold 

No threshold 

Cl 

Variation coefficient In nominator < 400% 

Variation coefficient in denominator < 60% 

Variation coefficient in nominator < 400% 

Variation coefficient in denominator < 1000% 

No threshold 

C2 

No threshold 

No threshold 

No threshold 
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Fig- 8: Illustration for the linearization of branches (left) and loops (right) in the process chain, further explanation see text 


necessary to split branches into separate linear chains for 
the calculation, since they can be calculated in the same way 
as a single linear chain or stem. Therefore, results for the 
linear chain are also valid for branched process chains. 

The case of loops in the process chain is more complicated. 
The figure shows that a loop can be written as a sequence of 
branches. Unlike a normal branch, these branches contain 
many identical processes. Another point to consider is the 
convergence of the loop, which is essential for being able to 
calculate any reasonable uncertainty figure. The calculation is 
hence not completely trivial, and would need more space for 
an explanation. Basically, the system may be calculated as a 
branched system, and results for processes that appear multi¬ 
ple times (e.g. process 4 in the linearised system in Fig. 8) are 
obtained by adding the results for each of the appearances 
of the process. Recent tests show that the thresholds pro¬ 
vided in Table 1 are valid also in this case [2]. 

3.5 Applicability and limitations of model and results 

Results of the paper are the threshold values, the recom¬ 
mended approach for uncertainty estimation, and the ap¬ 
proach of a virtual uncertainty case study. 

Threshold values are developed for a linear chain of proc¬ 
esses, but recent findings show that they also apply for more 
complex product system structures, see section 3.4. They de¬ 
pend to some degree on the calculation algorithm employed 
in the model. Both sequential and matrix algorithms may be 
closer than it seems at first glance; the Gaussian Elimination 
algorithm, for example, requires a repetitive calculation of 
quotients of elements of matrix A, which is similar to the quo¬ 
tients needed for process scaling factor calculation in the se¬ 
quential approach, and the term 'matrix inversion' is some¬ 
times, by Heijungs and Suh as well, used as a synonym for 
solving the equation system (without meaning to calculate the 
inverse of the matrix) [13]. At present, the threshold values 
are limited to the sequential approach. Changing the prob¬ 
ability distribution from normal to log normal did not change 
threshold values in Ciroth's thesis [16]. The model does not 
explicitly cover correlation among data. Data correlations may 
be relevant if they concern variables used simultaneously in 
the calculation. For covering uncertainty, a correlative term 
needs to be added to the approximation formulas, and the 
correlation needs to be considered in the random number gen¬ 
eration in the simulation. If strong correlation between data 
exists, an influence on the thresholds seems probable. 

The recommended approach for uncertainty estimation does 
not depend on specific threshold values, but it is most in¬ 


teresting for high thresholds, i.e. in cases where the approxi¬ 
mation formulas are able to provide sound estimates. 

The approach of investigating uncertainties by the use of a 
virtual case study, and by the use of explorative data analysis 
and a subsequent systematic analysis serves to condense the 
large number of parameters, to study the behaviour of calcu¬ 
lation algorithms under uncertainty, and allows one to set true 
values, and true (random) errors. It is applicable for other 
types of systems, algorithms (matrix algorithms, or damage 
functions in impact assessment), and data distributions. The 
use of virtual data is recognised and well accepted in other 
scientific fields like biological modelling (e.g. [39]), while it 
has not yet been used often for LCAs. An example is [38]. 

4 Conclusions 

Integrating approximation formulas and a Monte Carlo 
Simulation approach in a model that calculates an LCA based 
on virtual data enables to separate clearly true values and 
errors, to analyse properly the uncertainty propagation in 
an LCA calculation, and to investigate how well, and under 
different conditions, the approximation formulas cope with 
estimating the propagated uncertainty. 

The calculated relative uncertainty, the variation coefficient, 
turns out to be the parameter that dominantly determines 
the performance of the approximation formulas (Gauss, 
Bader and Baccini) in the model. The paper presents quanti¬ 
tative thresholds for the variation coefficient, for each cal¬ 
culation step of an LCA, and for the Gauss and the Bader 
and Baccini formula. If an approximation formula calcu¬ 
lates a variation coefficient below these thresholds, the model 
shows that the estimate of uncertainty given by the formula 
is good, and equal to a simulation result. Consequently, ap¬ 
proximation formulas can use the variation coefficient to 
self-check their estimate of uncertainty. It is possible to con¬ 
trol the goodness of the estimate during the calculation, 
which was one of the aims of the model, see section 3. 

It is then straightforward to combine approximation for¬ 
mulas and the Monte Carlo approach in the uncertainty 
calculation, combining their benefits without the downsides 
(which are a questionable correctness for the approxima¬ 
tion formulas, and, sometimes, an excessive demand for time 
and computer resources for the simulation). This combina¬ 
tion is proposed in further detail in the paper. 

The model and approach limits itself to the 'technical side' 
of uncertainty, namely the calculation, disregarding where 
the uncertainty stems from. It calculates a linear chain of 
processes, which can be broadened to branched and even 
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looped structures without impeding the results obtained, as 
outlined in section 3.4. It does not explicitly consider corre¬ 
lated data; applying its results, thus, has the condition that 
no such correlation exists to a relevant amount. 

If the latter condition holds, results seem broadly applicable 
for LCA calculations. The approach to virtually calculate 
and investigate the uncertainty propagation proved to be of 
utmost benefit and convenience, and we recommend its use 
for other, similar problems. 

Several possible paths for further work come in sight, includ¬ 
ing an analysis on how to manage the input uncertainty, in¬ 
cluding suitable sampling and estimation techniques, using the 
approach for real case studies, and implementing it in LCA 
software for automatically applying the proposed, combined 
uncertainty model. For broadening the applicability, investi¬ 
gating correlations in data and their impact on the results has 
promise, and further exploring the shortly mentioned ap¬ 
proaches for relevance estimation in Life Cycles will give light 
to an often-asked practical question. Investigating for an opti¬ 
mal calculation code for LCAs, also regarding the different 
sensitivity to uncertainty, the differing condition number of 
the algorithms, would bring more light in the proper design 
and selection of a calculation algorithm. Finally, on the out¬ 
put side of the uncertainty problem (see Fig. 2), an analysis of 
best ways to manage uncertainty in the outcome of an LCA is 
necessary, for fostering a sound decision support, and for possi¬ 
bly lowering psychological barriers of stakeholders, decision 
makers, LCA experts and practitioners in dealing with uncer¬ 
tainty in LCAs. And last but not least, we should further ex¬ 
plore possible reasons for the occurrence of uncertainty in LCAs. 
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Appendix: Calculation formulas and approximation formulas used 
Note: The Bader / Baccini extension uses the general term 

the first term under the root being the Gauss approximation: s<'iy = sy Gauss | n th e following, only the second term under the square root, the 
extension s< 2 >y, is given to avoid repetitions, if not mentioned otherwise. 


Step 


A1.1 


Explanation and reference: For a chain of processes, the calculation starts at a 'central' process, and scales the linear factors of the other processes in 
the chain according to that process, processes before the central process (‘pre‘) and processes after the central process ('post 1 ) [22, pp. 405] [16]. 


Calculation formula 

A mOutput , 

Pn,d 1 1 


, =1 mlnputj 

with - p n : Scaling factor of process n; mOutputi: Output exchange of process l; mlnputj: Input exchange of process i 


-TT m ^ n P ut i \ 
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Bader / Baccini 
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( 1 ) Gauss 

S S D , for both types of processes, 'pre' and 'post'. 
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“ ? -> J / j / -j \ u kl “ mOutput k “ mOutput , ^ mOutput k ^ mlnput , 15 mJnpui k “ mOutput, u kl ** mOutput , ^mOuipui/ J 

V /»=i 

with aw, bid, c Wl d w being elements of square matrices A, B, C, D, with indices k for columns and I for lines, respectively, 
with - n: the number of the process in the process chain; M: the total number of parameters, for this process 
The matrices themselves are defined as follows: 

- for 'downstream' processes: 
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- for 'upstream' processes: 
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A 1.2 


Explanation and reference: A unit process may appear several times in the product system (take e.g. the unit process for electricity generation). It is more efficient 
to aggregate the scaling factors for these processes before multiplying them with the process balances, which is done in the subsequent step A2.1 [16]. The 
aggregation needs not care for the location of the process in the process system, a distinction between 'pre' and 'post 1 processes is no longer necessary. 


Calculation formula 

k 

p n =Y.p™ 

m— I 

with - p n : Scaling factor of process n; p nm : Scaling factor of the m-th occurrence of process n 
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Step 



Gauss 

Gauss 

tu 

m—\ 


Bader / Baccini 

In this step, the Bader Baccini extension s (2) y is zero since higher derivates of the calculation formula are zero. 

A2.1, 

A2.2 

Explanation and reference: The linear, aggregated scaling factors for each process are multiplied with the process balances, which contain 
elementary input and output flows (step A2.1). The product is then aggregated per elementary flow over all processes. Result is the inventory 
balance of the product system, with entries for each elementary flow (step A2.2) [16]. 


Calculation formula 

k 

m i =YjPn m n, 

n =1 

with — p n : Scaling factor of process n; m,*: Elementary flow i in process n; nr*: Elementary flow i 


Gauss 
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ni 

litjk fk t+Alk kk fkf) 

k= 1 /=! Z 

B1 

Classification is a loqical assignment without calculation formulas. 

B2.1 ; 

B2.2 

Explanation and reference: Elementary flows as result from the inventory calculation are multiplied with applicable characterisation factors for 
obtaining indicator values in impact categories (step B2.1), in step B2.2, these indicator values per category are aggregated. [16] 


Calculation formula 

k j=Y J m i c ij 

i 

with - kj: Indicator value in impact category j; c*: Indicator value in category j for elementary flow m t 


Gauss 


Gauss 

IH( c ij- s J 2+ i m r s cJ 

i 


Bader / Baccini 


k =1 1=1 Z 

Cl 

Explanation and reference: The normalisation step sets the aggregated indicator values per impact category in relation to reference values [16], [31] 


Calculation formula 

' k Jref 

with - Zf. Normalised indicator value in category j; kf. Indicator value in category j; k jre f: Reference indicator value for category j 
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C2 

Explanation and reference: This step simply subtracts the normalised results of one scenario from those of another scenario, per impact category [16], [31] 


Calculation formula 

^ Delta j a ^ j a ref 

with - z Dana j Difference of scenario a and reference scenario, for category j; z&: Normalised indicator result for category j, scenario a; 

Zj re {! Normalised indicator result for category j, reference scenario 


Gauss 


2 Delta i a 



Bader / Baccini 

In this step, the Bader Baccini extension s (2) y is zero since higher derivates of the calculation formula are zero. 
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